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Abstract —The main proteinase of SARS-associated coronavirus (SARS-CoV) plays an important role in viral transcription and 
replication, and is an attractive target for anti-SARS drug development. The important thing is to understand its binding mech¬ 
anism with possible ligands. In this study, we investigated possible noncanonical interactions, potential inhibitors, and binding 
pockets in the main proteinase of SARS-CoV based on its recently determined crystal structure. These findings provide a wide clue 
to searching for anti-SARS drug. Interestingly, we found that similar structure patterns exist in SARS-CoV main proteinase with 
Poliovirus 3c Proteinase, Rhinovirus 3c Protease, Nsp4 Proteinase From Equine Arteritis Virus, Hepatitis C Virus Ns3 Protease, 
Hepatitis A Virus 3c Protease, and Dengue Virus Ns3 Protease. It suggests that the available drugs in these viruses could be used to 
fight SARS disease. 

© 2004 Elsevier Ltd. All rights reserved. 


1. Introduction 

Since November in 2002, a highly contagious pneumo¬ 
nia, severe acute respiratory syndrome (SARS), has 
spread rapidly in Asia, North America, and Europe. A 
new SARS coronavirus (SARS-CoV) has been identified 
as the etiological agent of the disease. Its seriousness lies 
in rapid transmission and high fatality (around 15%). 
However, the origin of SARS-CoV is still unknown, and 
no effective drug or vaccine is available up to now. 

The SARS-CoV replicase encodes two overlapping 
polyproteins, ppla and pplb, which mediate viral rep¬ 
lication and transcription. While a special main pro¬ 
teinase, 3C-like proteinase, is responsible for the 
cleavage of polyproteins, its functional importance 
make it an attractive target for drug development. 
Fortunately, the crystal structures of SARS-CoV main 
proteinase has been determined, 1,2 which adopt a serine- 
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protease fold and is homologous to the main proteinases 
from human coronavirus and transmissible gastroen¬ 
teritis virus. 3 The goal of this study is to find potential 
inhibitors and locate the ligand-binding sites in SARS- 
CoV main proteinase based on comparison of nonho- 
mologous tertiary structures, hence to provide clues to 
rational drug design. 


2. Materials and methods 

The atomic coordinates of SARS-CoV main proteinase 
were downloaded from Protein Data Bank (ID 1Q2W). 
NCI program 4 was used to identify noncanonical 
interactions. VAST (http://www.ncbi.nlm.nih.gov/ 
Structure/VAST/vastsearch.html) and DALI (http:// 
www.ebi.ac. uk/dali/) programs were used to find similar 
structure patterns and in the main proteinase structure 
of SARS-CoV. The structure alignment was done by 
CE 5 and the structural comparison was performed by 
LGA. 6 The constituents of the binding pocket are 
determined by those residues that have at least one 
heavy atom (other than hydrogen) with a distance less 
than 5 A from a heavy atom of inhibitor, as did in Chou 
et al. 7 The visualization of 3D structure was generated 
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by PROTEINEXPLORER (http://www.proteinex- 
plorer.org). 


3. Results and discussion 

The noncanonical interactions in SARS-CoV main 
proteinase structures are shown in Figure 1. There are 
two pairs of main chain-side chain interactions: Glu288 
(donor) and Trp 207 (acceptor), Ilel 52 (donor) and 
Phe8 (acceptor). There are four pairs of side chain-side 
chain interactions: Arg40 (donor) and Tyr54 (acceptor), 
Arg298 (donor) and Phe8 (acceptor), Pro 184 (donor) 



Figure 1 . Noncanonical interactions in the structure of SARS-CoV 
main proteinase. (A) The residue pairs involved are: Arg298 and Phe 8, 
Glu288 and Trp207, and Ilel52 and Phe8, which are colored blue. (B) 
The residue pairs involved include: Arg40 and Tyr54, Pro 184 and 
Phel85, and Tyrl26 and Phel40, which are colored blue. The Cys-His 
catalytic dyad (Cysl45 and His41) are colored green. 


and Phel85 (acceptor), and Tyrl26 (donor) and 
Phe 140 (acceptor). These residues are marked as ‘@’ in 
Figure 2. 

Among these interactions, Phe8 accepts two N-H- • • n 
bonds in a sandwich fashion, one donated by a side-side 
chain Arg298, and one donated by a main-side chian 
Ilel52, as existed in human racl. 8 Taken together with 
another N-H • • n interaction between Glu288 and 
Trp207, these noncanonical bindings connected N-ter- 
minus and C-terminus of the enzyme together, then 
fixed to He 152 (domain II) and Trp207 (domain III), this 
makes the domain II and III not flexible due to the loop 
formed by above interactions in the right and a loop 
already existed in the left (Fig. 1A), that is stabilizes the 
structure of the protease. Furthermore, we turn to 
examine the remaining three pairs of interactions: Arg40 
and Tyr54, Prol84 and Phel85, and Tyrl26 and Phel40. 
It can be seen from Figure IB that these noncanonical 
bindings will be able to stabilize the protein structure 
around the active center formed by Cys-His catalytic 
dyad (Cysl45 and His41). On the other hand, after 
binding, the small helix Arg40 and His41 locate will 
expect to move up, the loop Phe 140 and Cysl45 locate 
and the loop Pro 184 and Phe 185 locate will expect to 
move down, hence make the active center more open for 
substrate binding (Fig. IB). These results can be used for 
rational design of mutagenesis experiments and analysis 
of conservation of interactions at functional sites. In 
recent years, the noncanonical interactions have been 
shown to be important for the stability of protein 
structure 9 11 and ligand recognition. 12 

The structurally similar nonhomologous virus proteins 
identified by VAST and DAFI methods focus on 6 
proteases: 1CQQ (Rhinovirus 3c Protease With Ag7088 
Inhibitor), 1F1N (Poliovirus 3c Proteinase), 1MBM 
(Nsp4 Proteinase From Equine Arteritis Virus), 1DY8 
(Hepatitis C Virus Ns3 Protease with inhibitor FKI), 
1QA7 (Hepatitis A Virus 3c Protease with inhibitor 
NFA) and 1DF9 (Dengue Virus Ns3-Protease Com- 
plexed With Mung-Bean Bowman-Birk Inhibitor). The 
structure-based sequence alignment of the above six 
proteases with SARS-CoV main protease is shown in 
Figure 2. The results demonstrate that a number of 
similar beta sheets (marked by b) exist. In particular, 
there are three similar structural patterns showing con¬ 
servative residues (bold representation). The catalytical 
important residues are included in the first two highly 
conservative patterns: His-Cys or His-Ser catalytic 
dyads. 2,13 18 In the third pattern, Gly is conservative in 6 
proteases except SARS-CoV main proteinase, the sig¬ 
nificance of mutation from Gly 161 to Tyrl61 in SARS is 
unclear. Anyway, the three structural patterns form the 
common active center of these seven enzymes, as seen in 
their superpositions (Fig. 3). 

Since these nonhomologous virus proteases share the 
same active center, we can consider the possibility of 
applying the inhibitors from these enzymes to SARS- 
CoV main proteinase. In fact, Anand et al. 3 has indi¬ 
cated that the inhibitor AG7088 may be modified 
to make it useful for SARS therapy. Chou et al. 7 
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1Q2W:A 
1L1N:A 
1CQQ:A 
1MBM:B 
1DY8:A 
1QA7:A 
1DF9:A 


@ # @# ### @ 

0 AFPSGKVEGCMVQVTCG-TTTLNGLWLD-DTVYCPRHVIC-TAEDMLNPNYEDLLIRKSNH 

0 DYAVAMAKRNIVTATTS-KGEFTMLGVH-DNVAILPTHASP-GE 

0 -FGMSLIKHNSCVITTE-NGKFTGLGVY-DRFVVVPTHAD-PG 

0 NVGFVAG-SSYGTGSVWTRNNEVWLTASHVVGRAN-MA 

0-DGE VQVLSTA-TQS FLATCVN-G - VCWTVYHGAG-SK 

0 LEIAGLVRKNLVQFGVGEKNGSVRWVMNALGVK- -DDWLLVPSHAYKFEKDYE-MME 

0-RIKQKGILGY-SQIGAGVYKE-G - TFHTMWHVT-RGA 

bbbbbb bbbbbbb bbbbb 


1Q2W:A 
1L1N:A 
1CQQ:A 
1MBM:B 
1DY8:A 
1QA7:A 
1DF9:A 


70 SFLVQAGNVQL-RVIGHSMQ-NCLLRLKV DTSNPKTPK- - YKFVRI QPGQTFSVL 

70 SIVID--GKEV-ElLDAKALEDQAGTNLEITIITL-KRNEKFRDIRPHIPTQI-TETNDGVLI 

70 KEIQV-DGITT-KVIDSYDLYNKNGIKLEITVLKL-DRNEKFRDIRRYIPNNE-DDYPNCNLA 

70 TLKIG- -DAML-TL-TFKKN-GDFAEAVTTQSELPGNWPQLH- - FA- -Q PTTGPASWC 

70 TL-AGPKGPITQMYTNV-DQDLVGWPA-PPGARSMT--PCTCG-SSD--LYLV 

70 FYFNRGGTYYS - ISAGNVVIQSLDV GAQDWLMKV- - PTIPKFRDITQHFIKKGDVPRALNRLATLV 

70 VLMHK-GKR-IEPSWADV KKDLVSC-GGGWKLE--GEWKE-GEE- -VQVL 

bb bbb bbb bbbbb bbbb 


1Q2W:A 
1L1N:A 
1CQQ:A 
1MBM:B 
1DY8:A 
1QA7:A 
1DF9:A 


## # @ @ #### 

14 0 ACYN-GSPSGVYQCAMRPN-HTIK-GSFLNGSCGSVGFN 

14 0 VNTSKYPNMYVPVGAVTEQGYLNL-GGRQTARTLMYN-FPTRAGQCGGVITC 

14 0 LLANQPEPTIINVGDVVSYGNILL-SGNQTARMLKYS-YPTKSGYCGGVLYK 

14 0 TAT-GDEEGLLSGE-VCL-AWTTSGDSGSAWQ 

14 0 TR-HADVIPVRRRG-DSRGSL-LSPRPVSYLKGSSGGPLLC 

14 0 TTVNG- TPMLISEGPLKMEEKATYVHKKNDGTTVDLTVDQAWRGK-GEGLPGMCGGALVS 

14 0 AL-EPGKN-PRAVQ-TKPGLFKTNAGTIGAVSLDFSPGTSGSPIID 

b bbbbbbb b bbbb 


1Q2W:A 
1L1N:A 
1CQQ:A 
1MBM:B 
1DY8:A 
1QA7:A 
1DF9:A 


@ ## # 

210 IDYDCV-SFCYMHHMEL-PTGVHAGTDL- 

210 TG-KVIGMHVGGN-GSHGFAAALKRSYFT 

210 IG-QVLGIHVGGN-GRDGFSAMLLRSYFT 

210 GD-AVVGVHTGSN-TSGVAYVTTP- 

210 PS-GHVVGIFRAAVCTRGVAKAVDFIPVESM-- 

210 SNQSIQNAILGIHVAGG-NSILVAKLVTQEMFQ 

210 KKG-KVVGIYGNGVVTRSGAYVSAIAQTEKSI- 

bbbbbbbbb bbbbb 


Figure 2. Structure alignment between SARS-CoV main proteinase (1Q2W) and other proteases: 1L1N (Poliovirus 3c Proteinase), 1CQQ (Rhi- 
novirus 3c Protease), 1MBM (Nsp4 Proteinase From Equine Arteritis Virus), 1DY8 (Hepatitis C Virus Ns3 Protease), 1QA7 (Hepatitis A Virus 3c 
Protease), and 1DF9 (Dengue Virus Ns3-Protease). The residues marked as ‘b’ indicate similar beta-sheets. The bold residues indicate structurally 
similar patterns. The residues marked as ‘@’ make noncanonical interactions. The residues marked as ‘#’ make contact with inhibitors. 



Figure 3. The superposition of SARS-CoV main proteinase (1Q2W, 
white) with other proteases: 1L1N (Poliovirus 3c Proteinase, red), 
1CQQ (Rhinovirus 3c Protease, orange), 1MBM (Nsp4 Proteinase 
From Equine Arteritis Virus, yellow), 1DY8 (Hepatitis C Virus Ns3 
Protease, blue), 1QA7 (Hepatitis A Virus 3c Protease, gray), and 1DF9 
(Dengue Virus Ns3-Protease, green). 


conducted the docking studies of KZ7088 (a derivative 
of AG7088) to the SARS-CoV main proteinase (based 
on a theoretical model). Here, we studied the possibility 
of other inhibitors in anti-SARS drug development. 
Figure 4 shows the structures of docking other inhibi¬ 
tors from HAV, HCV and Dengue virus to SARS-CoV 
main proteinase, the residues that make contact with 
these inhibitors are marked as c #’ in Figure 2. Further¬ 
more, the corresponding binding pockets are shown in 
Table 1. The results reveal that these inhibitors can bind 
to SARS-CoV main proteinase well. 

Thus, the results above suggest that there is a link of 
SARS-CoV to human rhinovirus, Poliovirus, Arteritis 
virus, HAV, HCV, and Dengue virus. The inhibi¬ 
tors from these viruses could be modified to make them 
useful for SARS therapy. In fact, Cinatl et al. 19 
reported that human interferons, a medicine widely used 
in HCV patients, was useful for the treatment of 
SARS. Anyway, whether relationships between 
SARS-CoV and these pathogens exist or not, the present 
results provide a wide clue to search for anti-SARS 
inhibitors. 
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Figure 4. The interactions between SARS-CoV main protease (1Q2W, 
white cartoon) with different inhibitors, binding pockets are repre¬ 
sented by blue ball-stick. (A) FKI inhibitor from Hepatitis C Virus 
Ns3 Protease (yellow spacefill). (B) NFA inhibitor from Hepatitis A 
Virus 3c Protease (yellow spacefill). (C) Bowman-Birk inhibitor from 
Dengue Virus Ns3-Protease (yellow stick). 


Table 1. Binding pockets of SARS-CoV main proteinase for different 
inhibitors 


FKI inhibitor 
from HCV 

NFA 
inhibitor 
from HAV 

Bowman-Birk inhibitor from 
Dengue vims 

VAL 

20 MET 

17 GLN 

19 PHE 

140 

THR 

25 VAL 

18 VAL 

20 LEU 

141 

THR 

26 GLN 

19 THR 

21 ASN 

142 

LEU 

27 VAL 

20 CYS 

22 GLY 

143 

ASN 

28 THR 

21 GLY 

23 SER 

144 

PRO 

39 THR 

24 THR 

24 CYS 

145 

ARG 

40 THR 

25 THR 

25 GLY 

146 

HIS 

41 THR 

26 HR 

26 SER 

147 

VAL 

42 LEU 

27 LEU 

27 TYR 

161 

CYS 

44 ASN 

28 ASN 

28 MET 

162 

CYS 

117 GLY 

29 PRO 

39 HIS 

163 

TYR 

118 HIS 

41 ARG 

40 HIS 

164 

ASN 

119 VAL 

42 HIS 

41 MET 

165 

SER 

139 GLN 

69 VAL 

42 GLU 

166 

PHE 

140 ALA 

116 ILE 

43 LEU 

167 

LEU 

141 CYS 

117 CYS 

44 PRO 

168 

ASN 

142 TYR 

118 MET 

49 THR 

169 

GLY 

143 ASN 

119 LEU 

50 GLY 

170 

SER 

144 GLY 

120 ASN 

51 VAL 

171 

CYS 

145 SER 

121 PRO 

52 HIS 

172 

GLY 

146 PRO 

122 ASN 

53 ALA 

173 

SER 

147 LEU 

141 TYR 

54 GLY 

174 

TYR 

161 ASN 

142 LEU 

57 PHE 

181 

HIS 

163 GLY 

143 VAL 

114 PRO 

184 

HIS 

164 SER 

144 CYS 

117 PHE 

185 

MET 

165 CYS 

145 TYR 

118 VAL 

186 

GLU 

166 GLY 

146 ASN 

119 ASP 

187 

LEU 

167 SER 

147 TYR 

126 ARG 

188 

VAL 

171 

THR 

135 GLN 

189 

HIS 

172 

ILE 

136 THR 

190 

ALA 

173 

LYS 

137 ALA 

191 

PHE 

181 

GLY 

138 GLN 

192 

ASP 

187 

SER 

139 ALA 

193 




ALA 

194 
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